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(57) Abstract: The present invention relates to meth- 
ods for the identification of novel hetero-associating 
coiled-coil peptides and uses of these peptides for het- 
ero-dimerizalipn of fusion proteins. It fuithenhpre re- 
lates to vectors, host cells useful for the production of 
these novel hetero-association peptides and (poly)pep- 
tides/prpteihs comprising these peptides. 
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For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette, 
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HETERO-ASSOCIATING COILEP-COIL PEPTIDES 

The present invention relates to methods for the identification of novel hetero- 
associating coiled-coil peptides and uses of these peptides for hetero-dimerization of 
fusion proteins. It furthermore relates to vectors, host cells useful for the production of, 
these hovel hetero-association peptides and (poly)peptides/proteins comprising these 
peptides, . 

increasingly, there Is a need for proteins which cbmbine two or more functions, such as 
binding to two different specificities, or binding and enzymatic activity, in a single 
structure. TypicaHy, proteins which combine two or more, functions are prepared either 
as fusion proteins or through chemical conjugation of the component functional 
domains. Both of these, approaches suffer from disadvantages. Genetic "single chain- 
fusions suffer the, disadvantages that (i) only a few {two to three) proteins can be fused 
(1), (ii) mutual interference between the component domains may hindei- folding, and (III), 
the size of the fusion protein may make it difficult to prepare. The alternative, chemical 
cross-linking in wfro following purification of independently expi-essed proteins, is difficult 
to control and invariably leads to undefined products and to a severe loss in yield of 

functionarmaterial. '■■■■■■K- • 

A third approach takes advantage' of using genetic fusions of functional to association 
domains which lead to a self-association on co-expression i^ appropriate host cells. To 
assemble at least two:different fragments^ fused to association domains, Vnk domains, 
must have a tendency to form hetero-multimers. In one approach; a natural protein or 
protein domain was dissected and fused to protein partners to achieve hetero- 
association of the fusion proteins via the reconstitution of the native-like structure of the 
dissected protein or protein domain (WO 96/13583). 

In principle, hetero-association can be achieved with complementary helices such as the 
hetero-dimerizing Jun and Fos zippers of the AP-1 transcription factor (2) or other helical 
coiled-coil structures which are involved in the oligdmerization of a wide variety of 
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proteins. Because of their small size and structural regularity, they have also been used 
as artificial domains to mediate oligomerization of various proteins (3, 4). For example, 
the association of two separately expressed scFv antibody fragments by C-terminally 
fused amphipathic helices in vivo provides homo-dimers of antibody fragments in £. coli 
(WO93/15210; 5, 6). Coiled coils consist of two or more amphipathic helices wrapping 
around each other with a slight supercoil. they contain a characteristic heptad repeat 
(abcdefg)n with a distinct pattern of hydrophobic and hydrophilic residues (Fig. 1A. (7, 
8)). The positions a and d, which form the hydrophobic Interface between the helices, 
are usually aliphatic and a have profound effect on the oligomerization state (9, 10). The 
positions b, c. e, g, and f are solvent-exposed and usually polar. The positions e and g, 
which flank the hydrophobic core, can make Interhelical interactions between gi and e:i.^5 
residues, and thereby mediate heterospecific pairing (11-.14). 

As most naturally-occurring coiled coils are homodimeric, synthetic, sequences have 
been designed to promote specific hetero 'oligomerization (11, li3-1 5). hiowever, It was 
observed that designed coiled coils which behave- well as synthetic peptides failed in 
fusion proteins expressed in E coli, as they were proteolytically degraded. 

Thus, the clear disadvantage of association domains based on hetero^associating 
helices is their pseudo-symmetry and their similar periodicity of hydrophobic and 
hydrophilic residues. This structural similarity resulted in a strong tendency to form. 
. homo-dimers and thus to lower significantly the yield of hetero-dimers (2. 16).. 
Furthermore, the formation of Jun/Fos hetero-dimers is kihetically disfavoured, and 
requires a temperature-dependent unfolding of the kinetically favoured homo-dimers, 
especially Jun/Jun homo-dimers (WO 93/15210; 2. 16). Because of the need for 
additional purification steps to separate the unwanted homo-dimers from hetero-dimers 
and the resulting decrease in yield, hetero-association domains based on amphipathic 
helices have so far not resulted in practical advantages compared to conventional 
chemical coupling. 
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Further, it is not currently possible to predict sequences of coiled coil-forming peptides 
that will simultaneously have high stability and heterospecificity as well as advantageous 
in-vivo properties, such as resistance to proteases. This is crucial to practical 
applications of optimal interacting heterodimers for in vivo studies of protein 
oligomerization, e.g. the design of bispecific miniantibodies (17). 

Thus, the techriical problem undertying the present inventiph is to provide association 
domains based on helical coiled-coil structures which lead to hetero^association. The 
solution to the above technical problem is achieved by the embodiments characterized 
in the claims. Accordingly, the present invention provides a method which allows to 
identify hetero-associating (poly)peptides. The technical approach, i.e. the design of an 
appropriate coiled-coil library and screening by using a library-vs^iibrary approach is 
neither provided nor suggested by the prior art. 

Thus, the present invention relates to a method' for the identification of hetero- 
associating (poly)peptides comprising the steps of: 
■ . (a) providing a library A of (poly)p.eptides/proteins comprising (poly)peptides A„, 
havingthe general formula: 
. . VAQLXEXVKTLXAXZYELXSXVQRLXEXVAQL 

wherein X represents a mixture of E, K,.Q, and R, and wherein Z represents a 
mixture of N and V, 

(b) providing a library B of (poly)peptides/proteins comprising (poly)peptides Bn 

having the general formula: 
VDELXAXVDQLXDXZYALXTXVAQLXKXVEKL 

wherein X represents a mixture of E; K, Q. and R, and wherein Z represents a 
mixture of N and V; 

(c) combining in a common medium the (poly)peptides/proteins of said libraries A 
and B; and 

(d) screening or selecting for a screenable or selectable property caused by the 
hetero-association of a (poly)peptide Am with a (poly)peptide Bn. 



wo 01/00814 



PCt/EPOO/05922 



4 

The term "(poly) peptide" relates to molecules consisting of one or more chains of 
multiple, i. e. two or more, amino acids linked via peptide bonds. 
The term "protein" refers to (poly)peptides where at least part of the (poly)peptide has or 
is able to acquire a defined 'three-dimensional arrangement by forming secondary, 
tertiary, or quaternary structures within and/or between Its (poly)peptide chain(s). This 
definition comprises proteins such as naturally occurring or at least partially artificial 
proteins, as well as fragments or domains of whole proteins, as long as these fragments 
or domains have a defined three-dimensionar arrangenpient as described above, 
In this context, the commonly known one-letter code for amino acid residues is used. 
The term " screenable or selectable property" refers to a property which is gerierated in 
the event of a successful interaction taking place during screening or selection." 
Examples for screenable selectable properties include, but are not limited to. binding to 
a target or presentation of a target for ligand:binding. enzymatic activity, transactivation 
of transcription of a reporter gene such as beta-galactosldase, alkaline phosphatase or 
nutritional markers such as his3 and leu, or resistance genes giving resistance, to an 
. antibiotic such as ampicillin, chloramphenicol, kanamycin, zeocin, neomycin, tetracycline 
or streptomycin. In another embodiment, the selectable or screenable property can be 
restoration of phage infectivity to a filamentous phage rendered non-infectious by 
deletion of the N-termina! domain(s) of the genelll protein (U.S. Patent No, 5,514.548). 

In a preferred embodiment,- the invention relates to a method wherein said libraries A 
and B are provided by providing libraries of nucleic acid sequences encoding said 
(pbly)peptides/proteins, followed by causing or allowing the expression of said libraries 
of (poly)peptides/proteins. 

Methods for providing libraries of nucleic acid sequences, and for causing or allowing 
their expression, either in vivo after transformation, transffection or transduction of 
appropriate host cells, using appropriate vectors, containing all elements necessary for 
transcription and translation such as promoter/operator elements etc.. or in vitro using in 



wo 01/00814 



PCT/EPOO/05922 



5 

vitro expression systems are well known to anyone of ordinary skill in the art (see e.g. 
Sambrook et al., 1989, Molecular Cloning: a Laboratory Manual, 2nd ed.). 

Further preferred is a method wherein said common medium are host cells, each cell 
harbouring nucleic acid sequences encoding a (poly)peptide/protein of each of said 
libraries A and B. 

Most preferred is a method wherein said (poly)peptides/proteins of said libraries A and B 
further comprise either a N- 6r a C-termirial fragment of the murine DHFR enzyme, and 
wherein said screenable or selectable property is insensitivity of the host cell to 
trimethoprim by reconstitution of the DHFR enzyme on hetero-association of 
,(poly)peptides Am and Bn- 

The DHFR assay has been published (WO 98/341 20; 1 9) and is further exemplified in 
the examples. 

In another embodiment, the present invention relates to a hetero-associating 

(poly)peptide Am taken from the list of: 
WINZIPA1 : VAQLEEKVKTLRAQNYELKSRVQRLREQVAQL 
WINZIPA2: VAOLRERyKTLRAQNYELESEVQRLREQVAQL 
WINZ|PA3:VAQLQEKVKTLRARNYELKSEVQRLEEKVAQL 
WlNZfPA4: VAQLEEQVKTLQARNYELKSKVQRLKEKVAQL 
WINZIPA5: VAQLEERVKTLRAQNYELKSKVQRLEEQVAQL 
WINZIPA6: VAQLEEQVKTLEAENYELKSKVQRLRERVAQL 
WINZIPA7: VAQLQEQVKTLEAQNYELESEVQRLKEQVAQL 
WINZIPA8: VAQLEERVKTLKAENYELESEVQRLKERVAQL 
WINZIPA9:VAQLEEKVKTLKAKNYELKSKVQRLKEKVAQL 
WINZIPA1 0: VAQLQEEVKTLQAENYELRSEVQRLEEEVAQL 
WINZIPA1 1 : VAQLRERVKTLRARNYELQSKVQRLKERVAQL 
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Furthermore, the present invention relates to a hetero-associating (poly)peptide Bn tal<en 
from the list of: 

WINZIPB1: VDELQAEVDQLQDENYALKTKVAQLRKKVEKL 
WINZIPB2: VDELKAEVDQLQDQNYALRTKVAQLRKEVEKL 
W1NZIPB3: VDELEAEVDQLKDQNYALKTKVAQLQKQVEKL 
WINZIPB4: VDELRAKVDQLQDENYALETEVAQLQKRVEKL 
WINZIPB5: VDELEAEVDQLEDQNYALQTRVAQLEKRVEKL 
WINZIPB6: VDELKAKVDQLKDKNYALRtKVAQLRKKVEKL 
WINZIPB7: VDELRAQVDQLQDKNYALRTRVAQLKKRVEKL 
WINZIPB8: VDELQAEVDQLQDQNYALRTQVAQLKKKVEKL 
WINZIPB9: VDELRAQVDQLEDQNYALETQVAQLEKEVEKL ■[ ' 
WINZIPB10: VDELQAKVDQLKDENYALQTKVAQLQKRVEKL 
\A/INZiPB1 1 : VDELRAEVDQLEDENYALRTRVAQLRKQVEKL 

Particularly preferred is the use of a hetero-associating (poly)peptide according to the 
present invention for the identification of optimized hetero-associating (poly)peptides in a 
method according to the present invention, wherein one of the hetero-associating 
peptide WinZipAm as listed tiereinabove is used instead of library A of 
(poly)peptides/proteins comprising (poly)peptides Am In step (a) above, or wherein a 
hetero-associating peptide WinZip Bn as listed hereinabove is used instead of library B 
of (poly)peptides/proteins comprising (poly)peptides Bn in step (bj avove. 

In a still further preferred embodiment, the present invention relates to an optimized 
hetero-associating (poly)peptide obtainable by the method of the present invention. 

In a most preferred embodiment, the present invention relates to a pair of hetero- 
associating (poly)peptides taken from the list of: 
WinZipAl and WinZipBI 
WinZipA2 and WinZipBI 
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WinZipAl and WinZip B2 
WinZipAS and WinZip B3 
WinZipA4 and WinZip B4 
WinZipAS and WinZip 85 
WInZipAS and WinZip B6 
WinZipA? and WinZip B7 
WinZipAS and WinZip B8 
WinZipAS and WinZip B9 
WinZipAl 0 and WinZip BIO 

WinZipAH and WinZip B11 . . 

In a yet further preferred embodiment,, the invention relates to a (poly)peptide/protein 
comprising one of the hetero-associating (poly)peptides, or an: optimized .hetero- 
associating (poly)peptide of the present invention, and a further (poly)peptide/protein. 

In that context, "(poly)peptide/protein comprising one of the hetero-associating 
(poly)peptides, or an optimized hetero-associatlng (poly)peptide of the present invention, 
and a further (poly)peptide/protein" refers to all constructs which comprise one of the 
hetero-association peptides according to the present invention and additional moieties. 
This comprises (poly)peptides/proteins which are expressed fronri a contiguous nucleic 
acid coding sequence. Additionally, this comprises constructs where the individual 
components are expressed from different nucleic acid coding sequences, or where the 
components are produced by peptide synthesis, and where the separate components 
are linked by the formation of disulfide bonds or by chemical conjugation. 

Still further preferred is a (poly)peptide/protein wherein said further (poly)peptide/protein 
is an enzyme, a toxin, a cytokine, a metal binding domain, a transcription factor, a 
member of the immunoglobulin superfamily, a bioactive peptide of 5 to 15 amino acid 
residues, a peptide hormone, a growth factor, a lectin, a lipoprotein, a peptide which is 
able to bind to an independent binding entity, or a functional fragment of any said further 
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(poly)pepticle/protein. 

In a most preferred embodiment, the present invention relates to a hetero-associated 
(poly)peptide/protein comprising at least two (poly)peptide/proteins of the present 
invention, associated by hetero-association of a hetero-associating (poly)peptide Am and 
a hetero-associating (poly)peptide Bn. 

The term "hetero-associated (poly)peptide/protein" refers to all bispecific and/or bivalent 
complexes formed by taking advantage of the hetero-associating peptides according to 
the present invention. These include, for example, constructs where said "further 
(pdly)peptides/proteins" are two antibody fragments directed against different 
specificities. Such bispecific constructs can be used to increase selectivity of antibody- 
based approaches in therapy of diseases where the target cells exhibit a pattern of two 
cell-surface markers distinct from that of non-target cells which may present one of the 
two markers. Furthermore, one of the antibody specificities may be directed to a target 
cells, whereas the second may be used to target a drug carrier moiety selectively to the 
target. Additionally, antibody fragments as targeting vehicles may be combined with 
(poly)peptides/proteins which serve as effector domains, such as enzymes or signalling 
molecules. ' 

Particularly preferred is a DNA sequence encoding a hetero-associating (poly)peptide 
taken from the list of WINZIPAI to WINZIPAil and WjnZipBI to WinZipBII, or 
encoding an optimized hetero-associating (poly)peptide or a (poly)peptide/protein of the 
present invention. 

Further preferred is a PNA sequence encoding a hetero-associating (poiy)peptlde 
wherein said DNA sequence hybridizes under stringent conditions to a* DNA sequence 
encoding a hetero-associating (poly)peptide taken from the list of WinZipAl to 
WinZipAl 1 and WinZipBI to WinZipB1 1 . 
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As used herein, the term "hybridizes under stringent conditions" is intended to describe 
conditfonsfor hybridization and washing under which nucleotide sequences at least 60% 
homologous to each other typically remain hybridized to each other. Preferably, the 
conditions are such that at least sequences at least 65%, more preferably at least 70%. 
and even more preferably at least 75% homologous to each other typically remain 
hybridized to each other. Such stringent conditions are known to those skilled in the art 
and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, New 
York. (1989). 6.3.1-6.3,6. A preferred, non^imiting exarriple of stringent hybridization 
conditions is hybridization in 6 x sodium chloride/sodium citrate (SSC) at about 45°G. 
followed by one or more washes in 0.2 X SSC. 0.1 % SDS at SO^-es^C. 

in still another ertibodiment the invention relates to a vector comprising a DNA sequence 
according to the invention. 

In a further embodiment, the inventiori relates to a vector comprising DNA sequences 
encoding at least two (poly)peptide/, comprising at least a hetero-associating 
{poly)peptide Am and a hetero-associating (poly)peptide Bn. 

Vectors which can be used in accordance with the present invention are well-known to 
the practitioner in the art. 

In another embodiment, the invention relates to a host cell containing at least one vector 
of the present invention. 

In a further preferred embodiment the host cell is a mammalian, preferably human cell, a 
yeast cell, an insect cell, a plant cell, or a bacterial, preferably E.coli cell. 

In a highly preferred embodiment, the invention relates to a method for the production of 
a hetero-associating (poly)peptide, an optimized hetero-associating (poly), a 
(poly)peptide/protein, or a hetero-associated (poly)peptide/protein of the present 
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invention, which comprises culturing the host cell of the present invention in a suitable 
medium, and recovering said (poly)peptide or said (poly)peptide/protein produced by 
said host cell. 

In a most preferred embodiment, the invention relates to a pharmaceutical composition 
comprising the hetero-assoclated (poly)peptide/protein or the present invention. 

Still further preferred is a diagnostic composition comprising the hetero-associated 
(poly)peptide/prote|n of the present invention. 

Further preferred is a kit containing at least one of 

a hetero-associating (poly)peptide. an optimized hetero-associating (poly)peptide. or a 
(poly)peptide/proteih of clairris. or a hetero-associated (poly)peptlde/protein of the 

■■ present bventipn; or ' : ' 

a vector of claims according to the invention, 
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FIGURE CAPTIONS 

Figure 1: (A) Schematic representation of a parallel dimeric coiled coil. Hatched bars 
indicate the possible interhelical interactions between e and g positions. (B) Schematic 
representation of the protein complementation assay as described in the Examples, 
Introduction of a mutation at the DHFR interface (11 14A) was used to increase selection 
stringency (19). (C) Overview of the library design depicted as a helical wheel plot from 
the N- to the C-terminusf (inside to outside. The randomized positions are indexed (*: 
equimolar mixture of Q. E, K, R. x: equimblar mixture of N. V). The selected residues 
from the predominant pair. WinZip-AI B1 (clone #1 . Fig. 1 D). are next to the randomized 
positions in the respective box. (D) Partial sequences (randomized positions) of clones 
from the highest stringency selection which survived at least until passage 10 (clone #1: 
WinZip-A1B1, clones #2 to #10: WinZip-A3B3 to WinZip-AHBII). Clone #1 (named 
WinZip-A1B1) was identified 18/22 times in passage 12 and 4/11 times in passage 10 
(the full amino acid sequences are shown, in Fig, 1 1). . 

Figure 2: (A) DNA constructs code for fusions between library proteins (shown as a- 
helical leucine zippers) and either fragment of murine DHFR (mDHFR). Fusions were 
created using either the wild-type or the mutant mDHFR fragment 2 (llel 14Ala), yielding 
LibA-PHFRlii and LibB-DHFR[ 2] or LibB-DHFR[2:l114A], respectively. (B) Principle of 
the mDHFR-fragment complementation assay: £. co// cells are cotransformed with both 
fusion libraries in minimal medium, in the presence of IPTG (for induction of expression) 
and trimethoprim (for inhibition of the bacterial DHFR). If the library proteins 
heterodimerize. mDHFR can fold from the Individual fragments resulting in active 
enzyme and bacterial growth. Both mDHFR fragments must be present, and 
dimerization Of the fused proteins is essential. In order for cell propagation to be 
possible. No growth is obsen/ed if any of these conditions is not fulfilled (19) . The 
surviving colonies are the result of "single-step selection" and can be directly analyzed 
by DNA sequencing. (C) ^'Competition selection" is undertaken by pooling colonies from 
(B) In selective, liquid culture (passage 0 or PO). propagating the cells and diluting into 
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fresh selective medium lor further passages. An aliquot can be plated and the resulting 
colonies analyzed by DNA sequencing. 

Figure 3: Competition selection and chain shuffling. (A) Approximately 1.42 x 10^ clones 
resulting from single-step. l114A-mutant selection were pooled (=P0) and competition 
selection was undertaken as described in Figure 2C. and In the Examples. At each 
passage, some cells were plated and colony sizes were quantitated. (B) Quantitation of 
the colony sizes from. (A). For. comparative purposes, quantitation of colony sizes of cells 
transformed with DNA of WinZip-A1 Bt (but not passaged in liquid culture) is shown. (C) 
Quantitation of the colony sizes from passages of the chain shuffling experiment: 
WinZip-B1-PHFR[2:l1 14A] ^ LibA-DHFRlt]. In (B) and (C) the numbers of colonies were 
normalized such that passages could be directly cornpared. 

Figure 4: (A) Schematic representation ot a leucine zipper pair visualized from the N- 
terminus illustrating e/g-interactions and the hydrophobic core formed by the a- and d- 
positions (B) Distribution of residues at the semi<andomized positions throughout 
selection. The number of zipper pairs sequenced is given in parentheses, save. "Before 
selection" where the theoretical distribution is reported. Each pair carries one core a-pair 
and 6 e/g-pairs. Neutral e/g-pairs have one or both residues as Gin. In "Competition 
(l114Ar only clones from P6 to P12 (not from earlier passages) were considered for 
analysis. Thus. 37 individual clones were sequenced, and most of the resulting 
sequences were identical in two or more clones. The distributions, were calculated 
. according to the frequency of sequence occurence (n=37). (C) Leucine zipper 
sequences obtained after competition selection and chain shuffling. The heptad 
positions (a to g) are followed by the heptad number (1 to 5). Invariant residues from 
GCN4 are underlined. Clear boxes indicate the semi-randomized e- and g-positibns and 
core a-position (aS). Circled residues were designed to contribute to helix capping. 
Shaded residues were designed for the introduction of restriction sites. Other residues 
are from c-Jun (LibA) or c-Fos (LibB). Arrows indicate putative e/g-interactions. 
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Figure 5: Efficiency of competition in a model selection. The selection was set up by 
mixing known numbers of cells expressing either GCN4-DHFR[1]/GCN4-DHFR[2:I1 14A] 
fusions or one of 7 LibA-DHFR[1]/LibB-DHFR[2:l114A] pairs previously selected by 
single-step selection. The starting ratio was 2.9 x 10^: 1(GCN4 to Lib). Competition 
selection was undertaken as described in Figure 2C, and in the Examples. The 
appearance of the library pairs in the pool was monitored by restriction analysis. A PvuW 
fragrnent (1138 bp) is unique to the LibB sequence of the LibB-DHFR[2] plasmid, while 
another (762 bp) is from pRep4 (repressor plasmid) and remains approximately 
constant. The bands were quantitated using the NIH Image gel; aiialysis function to 
calculate the ratio of LibB/pRep4 (indicated beiow each lahe). 

Figure 6: (A) Deviation of obsen/ed e/g-interactibns in the seieeted heterodlmer and the 
two putative homodimers from the statistically expected distribution in the absence of 
selection. Interactions are grouped in potentially attractive (E-K, K-E,.E-R, R-E; left black 
bars), neutral (Q Q, Q-E, E-Q, Q-K, K-O, Q-R, R-Q; grey bars) ^ind repulsive (E:E, K-K, 
K R, R-K, R-R; right black bars), (i) Low stringency selection: clones were subdivided 
into those with an N-N pair in the core a-pos!tion (n=8) and those. with an N-V or V-V'pair 
(n=6), (ii) medium stringer^oy selection: only clones with an Ash-pair in the core a- 
position were considered (n=23), and (iii) highest stringency selection: clones, which 
survived the competition selection at least up to passage 10 (f^ig. 1D) were considered. 
These were analyzed counting each sequence once (not weighted, n=10) or according 
to their f requence of appearance (weighted, n=37). (B) Number of selected pairs having 
a certain difference of attractive (grey bars) or repulsive interactions (black bars), 
respectively, between the heterodimer and its constituting homodimers. The definition of 
(i). (ii) and (Iii) is as in (A). 

Figure 7: Positional distribution of amino acids at eiach e- and g-position in sequences 
obtained from the highest stringency selection. The statistically expected random 
occurrence of each amino acid at each position was subtracted from the relative 
occurrence observed in the selection (Q left (black), E middle (grey), K/R right (black)). 
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Figure 8: (A) Determination of the molecular weight of WinZip-AIBI by sedimentation 
equilibrium. The upper panel shows the residuals between measured data obtained at 
10 /jM peptide concentration at 25''C and data fitted as monomer (top), dimer (middle), 
or trimer (bottom). The lower panel shows the residuals for a dimeric fit to the data set 
obtained at 150 jjM peptide concentration, IC'C. (B) Native gel electrophoresis. 1: 
WinZip-AI (pi 11.2); 2: WinZip-BI (pi 4.9); 3: WinZip-AIBI. (C) Urea titration of the 
heterodimer WinZip-AIBI (■), and the homodimers WinZip-AI. (a) and WinZip-B1 (▼). 

Figure 9: CD-measurements of the synthesized peptides of VVinZip-AIBI. (A) 
Temperature dependence of [3)222 for WinZip-A1B1 (■), WinZip-AI (a), WinZip-B1 (t), 
arid the calculated average of both homodimers (- - (B). Dependence of T^ and ATm 
-) on pH, and (C) on salt. It must be rioted that thermal defiaturation of WinZip-AI 
was not cdrnpletely reversible at 1 M salt concentration, and only 71% of the starting 
signal was regained. 

Figure 10: Sequencing profile of pools from passages of the chain shuffling WinZip-BI- 
DHFR[2:|114A] + LibA-DHFR[1]. Representative semi-randomized positions (see Fig. 4) 
were taken from a single competition experiment, such that the selection rates can be 
directly compared. The ratio of the individual triplet codons (central three nucleotides of 
each frame) was visually estimated (GAG = Gin; GAG = Glu; AAG = Lys; GGT = Arg; the 
equimolar random mix of the 4 codons results in the predominance of C at the first 
position, A at the second and G at the third). Mixed positions are marked by (NNN), 
positions where a single codon is dominant (> 50%) are marked in lowercase and those 
where the codon is clear (> 90%) are marked in upper case. For passages 0, 2 and 8, 
two independent sequencing reactions were performed, which yielded identical results. 

Figure 11 : (A) Sequences of (poly)peptides WinZipAl to WinZipAl 1 

(B) Sequences of (poly)peptides WinZipBI to WihZipBII 
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The examples illustrate the invention: 
EXAMPLES 

All reagents used were of the highest available purity. Sequencing was carried out either 
by cycle sequencing with fluorescence labeling (MWG-Biotech, Ebersberg, Gentiany) 
using a LiCor detection system or by automated sequencing with an ABI sequencer, 
Restriction endphucleases and DNA modifying enzymes were from PMarniacia and New 
England Biolabs. E. co// strain XLI-Blue (Stratagene) was used for subcloning .and 
propagation of the libraries. E. coli strain BL21 harboring the lacP plasmid pRep4 
(Qiagenj was cotransformed with the appropriatp DNA constructs for the survival 
assays. - ■ ;- ■.; .• ■ 

Abbreviations: - CD, circular dichroism; mDHFR, murine dihydrofolate reductase; 
WinZip: dominaht zipper pairs obtained from competition selection; WinZip-A1B1: 
original pair selected. comprising peptide A1 from libraryA and peptide Bi from libraryB; 
WiriZip-Al B2 and WinZip-A2B1 : optimized pairs comprising the original partner A1 or B1 
and the new partner B2 or A2, respectively. . 

Example 1 : Selection of hetero-associatlon peptides (see WO 98/34120, Example 

7). ■ ■ ■' . ■; - . • ■. ^ . . . y 

1 .1 Introduction 

Here we present a strategy for library- i^s-library screening in intact cells based on the 
folding of murine enzyme dihydrofolate reductase (mDHFR) from complementary 
fragments (18, 19). DHFR was genetically dissected into two rationally designed 
fragments, each of which can be fused to a library of proteins or peptides (Fig. 2A). 
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Members of one library which heterodimerize with a member of the other library drive 
the reassembly of the mDHFR fragments, resulting in reconstitution of enzymatic activity 
(Fig. 2B). Activity is detected in vivo using an E. co/Abased selection assay, where the 
bacterial DHFR is specifically inhibited with trimethoprim, preventing biosynthesis of 
purines, thymidylate, methionine and pantothenate, and therefore cell division. The 
reconstituted mDHFR, which is insensitive to the low trimethoprim concentration present 
in selection, restores the biosynthetic reactions required for bacterial propagation. As a 
result, the interaction between library partners is directly linked to cell survival and 
detected by colony formation. We have previously demonstrated the utility of this 
strategy with GCN4 leucine zipper-forming peptides, as well as with larger 
heterodimerizing partner proteins (19) with Kds ranging between 3 and 160 nM (20, 21), 
although the affinity limits have not been determined. 

1.2 Constructs for DHFR fragment complementation 

The DNA constructs encoding the A/-terminal (1-107) and C-terminal (108-186) mDHFR 
fragments have been previously described (19). ThQ vectors are variants of plasmids Z- 
F[1,2], encoding the N-terminal DHFR fragment, and . Z-F[3] or Z-F[3:I1 14A], 
respectively, encoding the C terminal DHFR fragment with or without the II 14A mutation 
(19), Briefly, each fragment was amplified by PGR with appropriate unique flanking 
restriction sites and subcloned into a bacterial expression vector (pQE-32 from Qiagen). 
Each plasmid encodes an N-terminal hexahistidine tag, followed by a designed flexible 
linker and the appropriate DHFR fragment. Unique restriction sites between the 
hexahistidine tag and the flexible linker allow subcloning of the desired library. 

1.3 Library design 

Our goal was to select for metabolically stable dimeric coiled-coils with high 
heterospecificity. Thus, two libraries were designed to meet the requirements of genetic 
diversity to prevent recombination, high helix stability and a high probability of 
complementarity, all within a reasonable library size (Fig. 1C). 

As templates for the outer, solvent exposed residues (positions b, c. f) we chose the 
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leucine-zipper regions of the proto-oncogenes c-Jun and c-Fos for libraryA and B, 
respectively, thus minimizing potential recombination despite the repetitive pattern of the 
heptads. Indeed, no recombination was found in any of the 80 clones sequenced. We 
chose a helix length of 4.5 heptads as good compromise between stability and size. For 
the hydrophobic core residues (positions a, d) we chose the residues of the parallel, 
homodimeric leucine zipper GCN4 (Va| at a. Leu at d). A single a-position in the middle 
of each helix is often occupied by a polar residue, most often an Asn, which forms a 
hydrogen bond inside the hydrophobic core (22, 23). Replacement of this Asn pair by a 
non-polar one increases the stability significantly, but leads to helices packing in 
different registers and orientations, as well as forming higher order oligomers (24-26). 
Since we could not ascertain a priori whether higher specificity or stability would be more 
advantageous, we included both by allowing Asn and Val at the core a-position with 
equal probability. A difficult problem in library design is to encode only the desired amino 
acids with a predetermined ratio. We solved this problem by using defined trinucleotide 
mixtures in the oligonucleotides, where each trinucleotide codes for one specific amino 
acid (27). 

The solvent-accessible residues at the e- arid g-posit|ons can form interhelical salt 
bridges or hydrogen bonds which can contribute to stability and heterorneric specificity 
(13, 14, 28, 29). Based on these results and on commonly occurring amino acids at 
■ these positions (30, 31), we simultaneously randomized all eight e- and g-positions with 
equimolar mixtures of Gin, Glu, Lys, Arg, also using trinucleotide , codons in DNA 
synthesis. Including the Asn-Val combination at the core a-position, each library had a 
theoretical diversity of 1.3x1 0^.. 

To increase the stability of the helices, we introduced helix capping residues in both 
libraries to saturate the missing hydrogen bonds at the helix ends with their side chains. 
Based on studies of helix-capping propensities in proteins (32, 33) and peptides (34, 
35), we chose T-X-X-Q (Ncap-N1-N2-N3) for libraryA. and S-X-X-E for libraryB. The C- 
cap has only a minor effect on helix stability (35). As Gly has a high preference for the 
C-cap position (32. 33), we added a C terminal Gly. This may contribute to helix 
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termination or extend the linker whicli connects tlie coiled coil to the DHFR fragments 
(Fig. 1B). 

1 .4 Library synthesis 

Trinudleotlde coddns (27) were used to code for randomized positions, all other 
positions were made with mononucleotides. 

LibraryA: TACTGTGGCGGAACTGNNNGAANNNGTGAAAACCCtTNNNGCTNNNXXX- 

TATGAACTTNNNTCTNNNGTGAGCGCTTGNNNGAGNNNGTTGCCCAGCTTGCTA . 

(encoding VAQLXEXVKTLXAXZYELXSXVQRLXEXVAQL. wherein X represents a 

mixtui'e of E, K, Q, and and wherein Z represents a mixture of N and V); 

libraryB: CTCCGTtGACGAACTGNNNGCtNNNGTTGACCAGCTGNNNGACNNNXXX- 

TACGCTGTGNNNACCNNNGTTGGCAGCTGNNNAAANNNGTGGAAAAQCTGTGATA 

(encoding VDELXAXVpQLXDXZYALXTXVAQLXKXVEKL, wherein X represents a 

mixture of E, K, Q, and R, and wherein Z represents a mixture of N and V) . 

(NNN = equimolar mixture of the tririucleotldes AAG. GAG, GAG, CGT; XXX = equimolar 

mixture of the trinucleotides AAT. GTT). 

Generation of the second strand and introduction of Sail and Nhel restriction sites, were 
achieved by PGR using the primers prA-fwd: GGAGTACTGGCATGCAGTCGACTACT- 
GTGGCGGAACTG and prA-rev: GGACTAGTACCTTCGCTAGCAAGCTGGGCAAG or 
prB-fwd: GGAGTACTGGCATGCAGTCGACCTCCGTTGACGAACTG and prB-rev: 
GGACTAGTGCTAGCTTGTGACAGCTTTTCCAC, respectively. This resulted in a 142 
bp double-stranded oligonucleotide for either library. 

1.5 Expression plasmids 

LibraryA and B were both digested with Sail and Nhel, gel purified and ligated to the 
appropriate vector (Fig 2) yielding the ptesmids LibA-DHFR[1l. LibB-DHFR[21, LibB- 
DHFR[2:I1 14A] (Fig. 2A). After subcloning, the resulting linl<er between either library and 
DHFR fragment was: A(SGTS)2 StSSGI for LibA and SEA(SGTS)2STS for LibB, To 
achieve maximal library representation, the ligation mixes were individually 
electroporated into XLl-Blue cells and selected with amplclllin on rich medium (LB). A 2- 
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to 7-fold over-representation of each library was obtained. The resulting colonies were 
pooled and the piasmid DNA purified such that supercoiied plasnnid DNA was obtained 
for cotransformation. The supercoiied DNA was cotransformed in BL21 cells yielding 
about 4x10^ double-transformants. We used BL21 cells with a transformation efficiency 
of no less than 5 x 1 0^ transformants per mg of DNA using 200 pg of DNA, or 2 x 10^ 
transformants per mg using 500 ng of DNA. In cotransformatioris, the occurrence of 
double transformation was calculated as the number of colonies growing under selective 
pressure with trimethoprim (described below) divided by the number growing in the 
absence, when cotransformed with equal amounts of each DNA of a given, pre-selected 
pair. In order to verify that the library populations encode the designed amino acids with 
the expected frequency, single clones from each library were randomly picked and 
sequenced before selection. No statistically significant biases were detected. Seventy to 
80% of each library had no mutations or frame-shifts, and thus the library- vs^ibrary 
combination yielded approximately 50% correct sequence combinations. Thus, the 
experimental library-vs-library size of correct pairs is estimated as 2x10®. ; - 

1.6 Selection procedure. 

Three selection strategies were tested here, each having a different level of stringency. 
In the lowest stringency selection, we screened two expressed libraries against each 
other in a single-step selection (Fig. 2B), where cells cotransformed with corpplementary . 
libraries were directly plated on selective medium plates (M9 medium with 1 //g/ml 
trimethoprim), and resulting colonies were analyzed, thereby identifying air interacting 
polypeptide partners. In the second strategy, we increased the selection stringency by 
using a mutant DHFR fragment (lle114Ala) containing the destabilizing I114A mutation 
in DHFR[2] which occurs at the interface between both DHFR fragments. This mutation 
prevents stable reassembly of DHFR from its fragments (19) and should thus require 
more efficiently heterodimerizing, as opposed to homodimerizing, interacting partners to 
drive enzyme reconstitution. Finally, we introduced competitive metabolic selection, 
where clones obtained with the second strategy were pooled and passaged through 
several rounds of competition selection, in order to enrich for the optimally 
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heterodimerizing partners. Thereby, the most stable heterodimers should have higher 
mDHFR activity and thus a growth advantage. 

Selective pressure for DHFR was maintained throughout all steps by inhibiting the 
bacterial DHFR with trimethoprim (1 mg/ml) in minimal medium. Ampicillin and 
kanamycin (100 mg/ml and 50 mg/ml. respectively) were also included in all steps to 
retain the library plasmids and the lacf repressor-encoding plasmid (pRep4), 
respectively. Expression of the proteins was Induced with 1 mM IPTG. When selecting 
on solid rpedium, growth was allowed for 45 hrs at 37*^0. 

When It was necessary to control precisely the starting number of cells in a competitipn. 
the number of viable cells in the starter cultures was quantitated as follows. The 
appropriate clones were propagated in liquid media under selective conditions and dilute 
aliquots vyere frozen at -80°C with 15% glycerol. One, aliqupt for each clone was thawed 
and plated under selective cohditldns. and the colonies counted after 45 hrs. The 
volurne of cells to use for PO was tfien calculated, such that each done should be over^ 
represented by a factor of at least 2000. Colony sizes (in Fig. 3) were evaluated using 
the NIH Image Particle Analysis Facility. When selecting in liquid medium, the starting 
O.D. (600 nm) was either 0.0005 or 0.0001 . Cells were propagated either in Eflenmeyer 
flasks or in a 10 liter New Brunswick fermentor, depending on the volume required to 
ensure adequate representation of all clones present, at 37°C with shaking, or stirring at 
250 RPM. After 1.0 to 24 hrs, O.D. (600 nm) reached 0.2 to 1.0 and cells were 
harvested. 

1.7 Single-Step selection 

As a first step in selection of heterodimerizing leucine zippers, a single-step selection 
was undertaken, using the wild-type mDHFR fragments, by cotransforming the libraries 
LibA-DHFR[ 1] and LibB-DHFR[2] and plating on selective media (Fig. 2B). This strategy 
applies only a low stringency of selection to the potential pairs, thus many library 
combinations were expected to be selected. Approximately 1.7% of the resulting 
ampicillin-resistant cells were doubly transformed, harboring (at least) one plasmid from 
each library when using 5 ng of each DNA. or 8% were doubly transformed when using 
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20 ng of each DNA, as seen from control transformations (calculated as described 
above). Of the doubly transformed cells which harbor no mutations or frame-shifts, 
approximately 35% formed colonies under selective conditions (Table 1). This result 
immediately demonstrates that even with relatively low stringency of selection, only a 
fraction of the possible combinations of the two libraries allows zipper heterodimerization 
leading to efficient mDHFR reassembly, Fourteen colonies resulting from two 
independent cotransformations were picked and the sequences encoding the zippers 
were determined. Even under these low stringency conditions there exist Important 
sequence biases ih these sequences relative to the uhselected ones (Fig. 4B). A 
reduction in same-charged e/g-pairs from 31.3% (unselected) to 19% (selected) and an 
increase in opposite-charged pairs from 25% (unselect6d) to 31% (selected) were seen. 
As well, a strong enrlChnriint of N-N pairing at the core a-pbsltlpn (25% uhselected vs 
57% selected) was observed. The characteristics that have been enriched are 
<3bnslst6nt with the selection of stable leucine zipper heterodlmers. . ; 

1.8 Increased stringency: use of the mDHFR lle114Ala m^ 

We repeated the single-step selection, using the llel 14A|a mutant of mDHFR (18, 19), in 
order to increase the stringency of selection. We reasoned that only library partners that 
form the most stable heterodimers can compensate tor the reduced ability of the 
mDHFR(lle114Ara) fragments to fold into active enzyme, resulting in higher enzyme 
activity and growth rates. When bacteria were cotransformed with LibA-DHFR[1] and 
ybB-DHFR[ 2:I114A], we observed a 5b-fold decrease in the number of colonies upon 
selective plating cornpared to the wild-type DHFR fragments (Table 1): Twenty-five 
colonies were picked from 3 independent cotransformations and the DNA sequences 
were analyzed. The increase In selectivity was concomitant with an extremely strong 
selection for N-N pairing at the core a-posltlon (92%; Fig. 4B). illustrating that the 
specificity of in-register parallel alignment provided by N-N pairing is more highly favored 
under these iprvivo selection conditions than the higher stability afforded by V-V pairing. 
Reassembly of mDHFR from Its fragnfients requires that in the final structure, the two 
fragment N-termini be brought close enough together to allow native-like refolding of 
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DHFR (see Fig. 1) (19, 36). The peptide linkers that connect the library sequences to the 
DHFR fragments must be sufficiently flexible to allow DHFR to fold from its fragments, 
but not so long that any C-terminal to A/-terminal orientation of the final folded leucine 
zipper would be allowed. As a result of this structural requirement, parallel in-register 
heterodimerizaition of the library peptides is the only configuration possible. Other biases 
in these sequences were also more pronounced than with the wt DHFR fragments (Fig. 
4B). In particular, an additional incriease in opposite-charged e/g-pairs from 31% to 37% 
was seen. In one case, a point-mutation resulted in a single clone (1/25) with a V-T pair 
at the core a-position. 

1.9 Competition selection: Efficiency of selection 

To further increase the selection pressure, we applied the principle of competition 
selection. We reasoned that, among selected zipper pairs, those which result in more 
stable heterodimerization will allow the most efficient enzyme reconstitution, leading to 
higher DHFR activity. If DHFR activity is limiting for growth, the higher activity should 
result in more rapid bacterial propagation, hence these cells would become enriched in 
a pool. Thereby, after sequential rounds of growth-competition> subtje differences in 
growth rate can be amplified, increasing the stringency of selection relative to the single- 
step selection. 

To determine the rate at which competition can enrich for particular partner pairs, we 
first set up a model competition with a limited number of clones as described in Figure 
1C. The initial cell mixture (PO) contained known amounts of viable cells expressing 
either GCN4- DHFR[.1]/GCN4-DHFR[2:I1 14A] or one of seven LibA-DHFR[1]/LibB- 
DHFR[2:1114A] pairs previously obtained in a single-step selection of those libraries, 
mixed at a ratio of 2.9 x 10 " : 1 (GCN4 : library clones). Productive association of the 
homodimeric GCN4 pair should occur only 50% of the time versus up to 100% for 
heterodimerizlng library clones, thus is disadvantaged. Within 3 passages, the library 
pairs were already visibly enriched (Fig. 5), and after 5 passages the measured ratio 
between a restriction fragment indicative of the library and a constant fragment from the 
repressor plasmid had reached its maximium, showing that enrichment was maximal. 
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Colonies resulting from passage 9 (P9) were sequenced. No GCN4 leucine zippers were 
present among 24 sequences analyzed. Therefore, enrichment of the library pairs over 
GCN4 by a factor of at least 24 x 2.9 x 10"^ = 7 x 10^ was achieved. Four out of the 7 
library clones initially present survived until P9, with varying distributions (data no 
shown). The experiment was also repeated at a lower starting ratio of GCN4 and the 
same library clones were enriched, consistent with their enrichement being truly the 
result of selection (and not of unrepresentative sampling). This indicated that selection 
among the pre-selected clones was hot as rapid as that seen between pre-selected and 
GCN4 zippers, but that the smaller differences between the pre-selected ones can still 
be amplified in selection. These results demonstrate that there is a direct link between 
reconstitution of mDHFR and growth rate. 

1.1 p Competition selection for optimal pairs 

Our ultimate goal was to select for the "best" among the zipper pairs obtained by single- 
step selection. We obtained a large initial number of clones by cptransforming bacteria 
with 0.5 mg of DNA each fronf? LibA-DHFR[1] and LibB-DHFFi[ 2:11 14A]. Approximately 
50% of cells were at least doubly transformed (52% + 10%, average of 2 Independent 
control experiments, calculated as described in the Experlrinental Protocol). We obtained 
approximately 1.42 x 10"* clones on selective medium, which arise from a 1.4 x 10^-fold 
selection factor (see Table 1), and were thus selected from (1.42 x 10"^) x (1.4 x 10^) = 
2.0 X 10^ library- v/S-library cotransformants. These were pooled and passaged. There 
was a clear increase in colony sizes with subsequent passages, indicating that faster- 
growing clones were taking over (Fig. 3A, B). At PI 2, the colonies are homogeneously 
large, showing similar growth rates among the clones. Twenty-two individual colonies 
from PI 2 were picked and sequenced, as well as 1 1 from PI 0 and 2 from each previous 
second passage. A single pair (WInZip-AIBI, composed of vyinZip-A1-DHFR[1] and 
WinZip-B1-DHFRi2:l114A]) was identified 18/22 times (82%) |n P12, 4/11 (33%) in P10, 
but not in previous passages (Fig. 4C). While other sequences were found in early and 
late passages, none was as enriched as WinZip-A1 B1 . In order to verify that the growth 
rate recorded after competition (PI 2) was independent of bacteria-specific factors 
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resulting from passaging, we cotransformed DNA from a pure clone of WinZip-AI B1 into 
fresh bacteria. The colony size distribution is similar for P12 and for the transformants 
(Fig. 3B), illustrating that the growth rate is a direct product of mDHFR reconstitution 
directed by the WinZip-A1B1 pair. The sequence bias observed at the core-a position 
was yet stronger here: only N-N pairing was recorded at the core a-position. When the 
biases at the e/g-posltions were calculated according to the occurrence of each 
sequence (n=:37), there was no significant change in opposite charged pairing (37?^); 
while a small increase in same-charged pairing was observed (from 23% to 26%) as a 
result of the two same-charged pairs which occur in the predominant WihZip-A1 81 (FigV 
48, C). However, when each unique sequence was considered only once (n=10) a 
furtherincrease of opposite-charged e/g-pairing was observed ; 

Example 2: Analysis of clbnes resulting from library screenin 

2.1 Introduction 

Clones resulting from the three selection strategies with increasing stringencies were 
analyzed and compared: (i) lowest stringency: 14 clones analyzed, (11) medium . 
stringency: 25 clones analyzed (see Table 2), (iii) highest stringency: 41 clones analyzed * 
from various passages. The last passage (PI 2) yielded a population dominated by a 
single pair of coiled-coil sequences, WinZip-AI 81, as described above (Fig. 10, clone 
#1 in Fig. I D), which was biophysically analyzed (see beiow), The sequences of clones 
surviving at least up to passage 10 are reported in Figure ID: By comparing the 
selected clones from the three strategies, we analyzed the preferences for the core a- 
position, the distribution of e/g-pair . combinations, and the presence of any bias for 
certain amino acids within the individual helices. v 

2.2 Selection in the core a-positlon 

Sequencing of 16 clones prior to selection showed an equal distribution of Asn and Val 
at the core a position. After selection a strong bias was found toward Asri-pairs, which 
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became stronger with increasing selection stringency. No bias was seen between Val- 
val or Val-Asn combinations. An Asn-pair was found in 57% (h=14) of the lowest 
stringency, in 92% (n=25) of the medium and in 100% (n=41) of the highest stringency 
selection. Thus, the specificity gained by this polar interaction clearly outweighs the 
more energetically stable Val-pairs. this seems to be a very important feature, since a 
strong selection occurred even at the lowest stringency. , 

2.3 Selected e/g-pairs In hetero- and putative homodimers- 

All selected elbnes must form a heterodimer to reconstitute DHFR activity, however; the 
heterospecificity, and thus the ratio of heterodimers (libraryA/libraryB) to hombdimers 
(libraryA/libraryA and libraryB/libraryB) can vary, and a mixture of both may still generate 
sufficient amounts of active enzyme to allow cell propagation linder low stringency 
conditions. We analyzed the average occurrence of all e/g-pair combinations in the 
heterodimers'and the putative homodimers arising from the various selections in relation 
to the randditi statistical distribution (Fig. 6A). The selected heterodirners show on 
. average more attractive and less repulsive interactions than expected in a randohi 
populatidn, indicating selection for stability. This trend, although increasing with higher 
stringency., is already observed in the .lowest stringency selection, indicating that a 
certain threshold of stability is needed to induce enzyme dimerization. Selection for 
heterospecificity is achieved by a higher stability, of the heterodimer relatively to the 
homodimers and is only observed in the. medium and higHest ' stringency selections 
using the destabilizing 1114A DHFR[2].mutant (compare Fig. SA. d) vs (ii) and (iii)). . 
interestingly, this effect is more pronounced for llbraryA homodirners than for libraryB 
homodimers. and the biophysical charaicterlzation of WinZip-AIBI indicated that the 
libraryA homodimer is more stable (see below), and thus might have a stronger 
influence on titrating out fragments than the libraryB homodlmer. 
To determine the degree of heterospecificity achieved in the various selections, the 
relative stability of heterodimer versus homodimer was estimated for each single clone. 
We calculated the difference of attractive or repulsive interactions, respectively, between 
the heterodimer and the average of the two corresponding homodimers and displayed a 
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histogram of pairs as a function of this difference (Fig. SB). The results clearly show that 
heterdspecificity is achieved not only by a decrease of repulsive interactions but also by 
an increase of attractive interactions in the heterodimer relative to the homodimers. 
However, the lowest and medium stringency selection yielded still a certain fraction of 
pairs with no difference. or even a reversed ratio, whereas the highest stringency 
selection exclusively yielded pairs with distinct heterospecificity. In addition, in no case 
were morfe than three repulsions found in the heterodimers, although in a random 
combination 8% of all pairs shdulf have 4 to 6 repu!^^^^^^ 

2.4 Positional distributiori of the selected arnino acids 

Intrahelical electrostatic interactions can in^^^^ and may even promote - 

selection of apparently repulsive e/g-pairs: Interactions with the helix macrodipole, for 
example, can modulate stability in coiled coils, (37). Indeed, we observed a bias for y 
negatively charged and neutral amino acids in the N-terminal part and positively charged 
amino acids in the C-terminal part (Fig. 7). This positional preference may at least 
partially compensate the loss of stability resulting from a repulsive e/g-lnteraction. .In 
addition, interactions with adjacent residues on the outside, of the helix (b- arid c 

- positions) may influence the cdntributions of charges at the e^ ^nd g positions (38). This 
may explain , why at pdsitidn el in libraryB a negatively charged amino acid Is not 

• favored, contrary to the expected counterbalancing of the helix dipole. since this position 
is adjacent ?t6 two aspartates in positions b1 and b2. On a more general note, the 
predominantly selected, sequence with the residues from c-Jun at the outer positions 

, (libraryA) bears remarkable similarity to the e. and g sequences in the naturally- 
occurring c-Jun. ., . . 

2.5 Library complexity 

Although the predominantly selected sequence pair WinZip-AI 81 showed all the desired 
properties in vivo as well as in vitro (see below), we were not able to cover all theoretical 
library vs-library combinations in our selection. Nonetheless, we covered all possible 
electrostatic interaction combinations (+/+. -/-. +/-. +/n, -/n, n/n; n=neutral) in all six 
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interacting e/g-positions. when grouping the core a-position into favored (Asn-Asn) or 
disfavored (Asn-Val. Val-Val) combinations. This reduces the theoretical library size from 
1.7x10'° to 9x10* possibilities, which was well covered by the experimental library of 
2x10^ It is therefore, likely that WinZip-A1B1 contains the most important features for 
stability and heterbspecificity. Furthermore, the random probability of finding pairs with 
no repulsive interactions was 1:40. and with solely attractive interactions was 1:1. 6x1 0^ 
Thus, our selection covered a representative sequence space and the same-charged 
interactions in WinZip-AI BI are not a i-esult of incomplete library sampling but must 
have more, subtle reasons, including in-vivo factors, which we cannot fully address. 
Furtherrhore. in the medium as well as in the highest stringency selection 13 out of 38 
pairs sequenced had no repulsive e/g pairs, but none competed successfully against. 
WinZip-AIBI in tfie selection. 

Example 3: Biophysical characterization of WinZip-A1.. WinZip^BI and WinZip- 
A1B1 

3.1 Secondary structure and oligomerization state of the predominant pair 
WinZip-A1B1 

we investigated the stability and specificity of the predominantly selected peptides 
WinZip-AI and WinZip-B1 alone and in an equimolar mixture (WinZip-AIBI). All 
experiments were performed with N-acetylated and C-amidated synthetic peptides. 
311 Peptide synthesis and purification 

The peptides WinZip-AI: Ac-STTVAQLEEKVKtLRAQNYELKSRVQRLREQVAQLAS- 
NH2 and WinZip-B1: AC-STSVDELQAEVDQLQDENYALKTKVAQLRKKVEKLSE-NH2 
were synthesized (Applied Biosystem 431A) and purified by reversed^phase HPLC. 
Electrospray mass spectrometry confirmed purity and identity of the peptides with a 
mass deviation of less than 1 Da. Peptide concentrations were determined by tyrosine 
absorbance in 6 M GdnHCI (39). 
3.1.2 Circular dichroism measurements 
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All peptides formed stable a-helical coiled coils as demonstrated by CD-spectra. 
CD spectra were recorded at S^'C at a total peptide concentration of 150 /jM (1 mm 
cuvette, Aviv 62DS spectrometer). The standard buffer was 10 mM K2HPO4/KH2PO4, pH 
7.0. 100 mM KF; salt concentration and pH were varied as indicated in the respective 
experiments. Thermal denaturations were measured at 222 nm in steps of 2.5''C (2 min 
equilibration, 30 s averaging). Thermal transitions were >91% reversible except where 
indicated. Apparent Tm were determined by least-squares curve fitting of the 
denaturation curves (40), assuming a two-state model. ATm. was calculated as 
Trti(WinZip-A1B1)-y2[Tm(WinZip-A1)+Tm(WinZip-B1)^ Urea denaturation equilibria were 
determined at 20°C by automated titration of native peptide with denatured peptide in 6 
M urea (30 /yM WinZip-AI or WinZip A1B1, respectively, or 60 //M WinZip-Bl) 
measuring the CD signal at 222 nm (300 s equilibration, 30 s averaging). Kp values: were 
calculated by linear extrapolation to 0 M denaturant assuming a two-state model 
(KD=[unfolded mohomerf/[folded dimer]). 

The helical content was in the range of 90% (WinZip B1) to 100% (WinZip-Ai and 
WinZip-AIBI). Peptide WiriZip-AI as well' as the mixture WiniZip-AIBI (Fig. 4A) were 
dimeric at 10°C and 25X over a concentration range from 10 //M to 150 )l/M as 
determined by equilibrium sedimentation. WinZip-BI was partially unfolded as seen both 
by CD (Fig. 8C at 0 M urea) and equilibrium sedimentation. - 

3. 1 .3 Equilibrium sedimentation. 

Equilibrium sedimentation experiments were performed using a Beckman XL-A 
Ultracentrifuge. Absorbance was monitored at 220 and 275 nm at peptide 
concentrations of 10, 50 and 150 //M in 10 mM K2HPO4/KH2PO4, pH 7.0. 100 mM KCI. 
Partial specific volumes and solvent densities were deterrriined as described (41). The 
data sets were fitted to single molecular masses of monomer, dimer and trimer. 
Equilibrium sedimentation indicated a mixture of monomers and dimers, with decreasing 
amount of dimer at increasing temperature. 

3.1.4 Structural stability and heterospecificity 

Thermal denaturation studies at neutral pH (Fig* 9A) revealed apparent Tm values of 
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ZS^'C (WinZip-B1), 49°C (WinZip-A1). and 55°C for the equimolar mixture of both 
(WinZip-AlBI). The large difference between the denaturation curve of the heterodimer 
and the average of the curves from WinZip-AI and WinZip-B1 indicates that 
heterodimers form preferentially at equilibrium. This high heterospecificity is best 
reflected in a large and poisitive ATm value, and indeed, we observed a ATm of IB.S'^G. 

To probe the mechanism of specificity, the effects of pH (Fig. 9B) and ionic strength (Fig. 
9C) were investigated. All peptides were more stable at high pH, most likely because all 
have at least one e/g^pair with two positive charges which are neutralized at high pH. 
The increased stability of WinZip-BI at low pH could be du6 to the shielding of 
electrostatic repulsions resulting from its high concentration of acidic residues. However, 
the Aim is positive over the whole pM range indicating heterospecificity. The maximum 
degree of heterospecificity was observed at neutral to slightly basic pH, consistent with 
the intracellular pH of E. coli (42). High salt concentrations increased the absolute Tm 
values (Fig. 9C), presumably due to increased hydrophobic interaction or reduced 
electrostatic repulsion. However, the ATm is reduced compared to low salt 
concentrations (0-100 mM), most likely due to the decreased influence of ionic 
interactipns at higher ionic strength. ; 

Interestingly, the overall stability did not correlate directly with the number of potentially . 
repulsive e/g-interactions. The homodimer WinZip-BI has two same-charged ion pairs, 
but is significantly less stable than the homodimer WinZip-A1 with four same-charged 
pairs (Fig. 9). Since the overall helical propensity is comparable for both peptides 
according to (43), the difference is probably due to intrahelical interactions. LibraryB 
rfiight be destabilized by its high local concentration of acidic residues at the N-terminus. 
This may also explain why libraryA homodiniers have generally more repulsive and less 
attractive e/g-interactions than libraryB homodimers, since the e/g-pbsitidns play a 
' bigger role in the destabilizatipn of the intrinsically more stable libraryA in order to 
reduce homodimerization. 
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3.1.5 Native gel electrophoresis 

Heterospecificity also was observed by native gel electrophoresis (Fig. 8B). 
Gels (7.5% polyacrylamide (19:1), 375 mM B-aianlne acetate buffer. pH 4.5) were run 
with 500 mM B-alanine acetate buffer, pH 4.5. Samples (~1 0 ^/g peptides per lane) were 
two-fold diluted with 600 mM B-alahine acetate. pH 4.5. 0.2% methylgreen. 10% 
■ glycerol. Gels were prerun at 100 V tor at least 45 min, ran for 2-3 h at 5°C. and fixed 
with 2% glutaraldehyde or 20% (w/v) TCA, respectively, before staining with Coomassle 
blue.' ."■ 

To obtain a significant migration, an acidic buffer (pH 4.5) had to be used, arid.thus 
conditions where the heterospecificity is the. lowest (ATm of only 7''.C, compared to . 
. 16.5°C at neutral pH. Fig: 9C). Nevertheless, even uncliBr these stringent conditions 
heterodirriers were obtained almost exclusively from the equimolar mixture (Fig. 8Bi). 
suggesting very high heterospecificity at neutral pH, and thus indicating how strongly 
heterospecificity was selected for. 

3.1.6 Kd determination 

Dissociation constants of the peptides were derived from equilibrium urea denaturatipns 
(Fig. 8G). The heterodimer WinZip AlBI was the most stable species with a Kd of 
approximately 24 nM, while the homodimer WinZip-At had a Kd of approximately 63 nM, 
the accuraicy of the Kd. determination of WinZip-BI is lower since it Is already partially 
unfolded without denaturant (see above). The Kd was estimated to be in the 10'^ M 
range. Calculations were confirmed by determining the Kd . values from thermal 
denaturatipn curves by a van'l Hoff analysis, assuming as a first approximation a 
constant aH (40). We found reasohable agreement to the data obtained by urea 
denaturation Awith a maximal deviation of Kd by a factor of 2.6. 

3.1.7 Comparison to other coiled coils 

Designed coiled coils aire usually only judged for being stable in vitro and. in certain 
cases, for heterospecificity, whereas naturally-occurring coiled coils must also function 
reliably in a cellular environment. Similar demands are imposed on our selection and on 
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further in-vivo applications, and therefore WinZip-AIBI is best compared with other 
naturally-occurring coiled coils. The homodimeric coiled coil of the yeast transcription 
factor GCN4 has an equal or slightly higher Tm, depending on the length and 
concentration of the peptides chosen (44. 45). The N terminal homodimeric coiled coil of 
the APC protein has a Tm lower by at least 9°C than WinZip-AIBI (46). The 
heterodimeric coiled coil from c-Jun/c-Fos shows comparable Tm and aT^ values (2). 
However, those data were derived from disulfide-bridged peptides. The coiled coil from 
c-Myc/Max also heterodimerizes to a fairly high extent, but peptides of comparable 
length have a Tm pf only srC and a Kd of 60juM (as^C) (45), whereas our WinZlp-A1B1 
has a Tm of 55''C and a Kb of approximately 24 pM (20°C). Thus, WlnZip-A1B1 
compares successfully with naturally-occurring coiled coils and will therefore be very 
useful for a variety of in-vivo applications. 

Example 4: Chain shuffling of the WinZip-AI 81 sequences 
3.1 Introduction 

In the above experiment, WinZip-AIBI was selected from a sample representing 2.0 x 
10^ library- vs-library cotransformants. As the theoretical library-vs-library diversity is 
(1.31 X 10^)^= 1.72 X 10^°, approximately 0.01% of the library-vs-library space was 
sampled. However, we obtained a very high coverage of either single library (theoretical 
complexity of 1.31 x 10^), where the probability of all members being present at least 
once is P=0.973. Thus, each polypeptide sampled only a small portion of the opposite 
library (2.0 x 10^/ 1.31 x 10^ = 15.4 polypeptides of the other library with P=0.999, 
assuming equal transformation rates for both libraries) and it is likely that better 
combinations for the WinZip-AIBI peptides may be found. Using WinZip-A1B1 as a 
partially optimized starting point, we combined each of the two WinZip-AIBI 
polypeptides with the opposite library (WinZip-A1-DHFR[1] + LibB-DHFR[2:l114A] and 
WinZip-B1-DHFR[2:l1 14A1 + LibA-DHFRI1l). 
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3.2 Chain shuffling 

DNA from the WinZip-AI B1 clone was isolated and retransformed into bacteria in order 
to obtain clones carrying either plasmid WinZip-A1-DHFR[1] or WinZip-B1- 
DHFR[2:I114A]. A pure clone (for. each) was electroporated with the appropriate library. 
Library representation was calculated by comparison with control transformations of the 
same cells with DNA from the other WinZip-AI B1 polypeptide (calculated as the number 
of colonies growing in the presence of trimethoprim divided by the number growing in 
the absence). 

Single-step selection yielded pre-selected pools for either competition. In both cases, 
the library (1:3 x 10^) was over-represented by a factor of 24 and 14^ respectively, and 
the probability that all members were present at least bnCe as partners of the "constant" 
peptide is P > 0.999 and 0.882, respectively. With passages of selection competition, a 
clear increase in colony sizes was again observed, indicating that faster-growing clones 
were taking over (Fig. 3C). . . , 

3.2 Analysis 

At PO and each second passage, DNA from the entire pool of cells was sequenced in 
order to follow the rate of evolution of each library against a constant partner. Figure 10 
illustrates the results from representative semi-randomized positions. It is clear that the 
rate of selection Is not constant at all positions: some positions showed a dominant 
residue ( > 50%) already at P4 and clear selection ( > 90%) at P6 (see position e2) while 
others remained mixed (<50%) until P6 and became clear only at PI 0 (see position gS). 
This was observed in both selections. The sequences from individual colonies were 
analyzed, in both selections, a predominant clone was identified (Table 1 and Fig. 4C), 
which is similar, but not identical, to the originally selected WinZip-A1B1 pair. The 
selection of the predominant clone WinZipA2B1 (selection of LibA-DHFR[1] against 
WinZip-B1- DHFRI2:I114A]) was achieved before P10. as P10 (4 clones analyzed) and 
P12 (4 clones analyzed) revealed only this clone. The selection of the predominant 
clone WinZipA1B2 (selection of UbB-DHFR[2:l114A] against WinZip-A1-DHFR[1l) was 
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Clear but not complete after 12 passages, as it was identified 4/6 times in PI 2 and 3/5 
times in PI 0. 

Analysis of the biophysical properties of the peptide pairs selected in the chain shuffling 
experiments, performed as described in Examples 2 and 3. indicated that the observed 
e/g-interaction pattern is similar to that of WinZip-A1B1. 

Conclusioh 

We applied a fast and simple strategy, a library-vs-library selection with the fragment 
complementation assay, to select for a metabolically stable, dimeric and highly 
heterospecific coiled coil with high affinity. Comparison of the biitcome of various 
selections performed with different stringencies revealed insight into which properties 
are selected already at lower stringency, and are thus the most crucial for successful 
heterodimerization. and those which only become apparent at higher selection 
stringency, and thus represent a more subtle optimization. The most striking selection 
occurred at the core a-position for Asn-pairs. revealing that structural uniqueriess is 
essential for efficient and selective heterodimerization. Furthermore, comparison of 
selected e/g-pajrs from hetero- and putative hpmodimers indicated selection for stability 
even at the lowest stringency, whereas selection for heterdspecificity was more 
pronounced at higher stringency. Heterospecificity was achieved not Only by decreasing 
the numbers of repulsive e/g-interactiohs but also by increasing the number of attractive 
interactions in the heterodimer relative to the homodimers. 

The selection for heterospecificity (and thus against homodimers) may be a unique 
feature of this selection system. Not only is active enzyme exclusively formed by parallel 
heterodimers, but homodimers and higher oligomers are likely to have a negative effect 
by unproductively wasting fragments and perhaps even harmfully accumulating non^ 
functional enzyme. DImer stability, in turn, is dependent not only on e/g-pair interactions, 
but also on helical propensity, intrahelical interactions and helix dipole stabilization. 
Indeed, our analysis revealed that the most successful variants do not simply consist of 
complementary charges in the e/g-positions. but show a more complicated pattern, 
presumably fulfilling a variety of naturally conflicting demands on the sequence, whose 
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optimum would have been extremely challenging to predict. 

The biophysical characterization of the predominantly selected pair WinZip-A1B1 
revealed the formation of a stable dimeric coiled coil with very high heterospecificity, 
confirming the results from the sequence analysis and the validity of the selection 
strategy. The results obtained with VVinZip-AIBI and the peptide pairs identified in the 
chain shuffling experiment, are supporting the view that idealized sequences, based on 
the single principle of merely relieving repulsive e/g-jnteractions in the homodimers with 
complementary charges in the heterodimer, may not be optimal for biological 
applications 
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CLAIMS 

1 . A method for the identification of hetero-associating (poly)peptides comprising the 
steps of: 

(a) providing a library A of (poly)peptides/proteins comprising (poly)peptides 
Am having the generail formula: 

VAQLXEXVKTLXAXZYELXSXVQRLXEXVAQL : 

wherein X repfesehts a mixture of E, K, Q, and R, and wherein Z represents a 

'mixture of N and V 

* ■ *' " ■ " • ' , . -*'■"' 

(b) providing a library B of (poly)peptides/proteins comprising (poly)peptides 
Bn having the generar formula: ' 

VPELXAXVDQLXDXZYALXtXVAQLXkXVEKL 

. / wherein X represents a mixture of E, K, Q, and B, and: wherein Z represents a 
mixtui-e of N and V; • . , 

(c) combining in a common medium the (poly)peptides/protein^ 
libraries A and B; and 

(d) screening or selectirig for a screenable or selectable property caused by 
the. hetero-association of a (poly)peptide Am with a (poly)peptide Bn- 

2. the method of claim 1 wherein said libraries A and B are provided by providing 
libraries of nucleic acid sequences encoding said (poly)peptides/proteins, followed 
by causing or allowing the expression of said libraries of (poly)peptides/proteins. 



wo 01/00814 



PCT/EPOO/05922 



40 

3. The method of claim 2 wherein said common medium are host cells, each cell 
harbouring nucleic acid sequences encoding a (poly)peptide/protein of each of said 
libraries A and B. 

4. The method of claim 3 wherein said (poly)peptides/proteins of said libraries A and 
B further cortiprise either a N- or a C-terminal fragment of the murine DHFR 
enzyme, and wherein said screenable or selectable property is insensitivity of the 

•host cell to trimethoprim by reconstitution of the DHFR enzyme on hetero- 
association of (poly)pepticles Am and Bn. 

5. A hetero-associating (poly)peptide Am taken from the list of: 

(a) WinZipAl: VAQLEEKVKTLRAQNYELKSRVQRLREQVAQL 

(b) WinZipA2: VAQLRERVKTLRAQNYELESEVQRLREQVAQL 
(C) WInZipAS: VAQLQEKVKTLRARNYELKSEVQRLEEKVAQL 

(d) ; WinZipA4: VAQLEEQVKTLQARNYELKSKVQRLKEKVAQL : 

(e) WinZipAS: VAQLEERVKTLRAQNYELKSKVQRLEEQVAQL 

(f) WinZipAS: VAQLEEQVKTLEAENYELKSKVQRLRERVAQL 

(g) winZipA7: VAQLQEQVKTLEAQNYELESEVQRLKEQVAQL 

(h) WinZipAS: VAQLEERVKTLKAENYELESEVQRLKERVAQL 

(i) WinZipA9: VAQLEEKVKTLKAKNYELKSKVQRLKEKVAQL 
G) WinZipAl 0: VAQLQEEVKTLQAENYELRSEVQRLEEEVAQL 
(k) WinZipAl 1:VAQLRERVKTLRARNYELQSKVQRLKERVAQL 

6. A hetero-associating (poly)peptide Bn taken from the list of: 

(a) WinZipBI: VDELQAEVDQLQDENYALKTKVAQLRKKVEKL 

(b) WlnZipB2: VDELKAEVDQLQDQNYALRTKVAQLRKEVEKL 

(c) WinZipBS: VDELEAEVDQLKDQNYALKTKVAQLQKQVEKL 

(d) WinZipB4: VDELRAKVDQLQDENYALETEVAQLQKRVEKL 

(e) WinZipBS: VDELEAEVDQLEDQNYALQTRVAQLEKRVEKL 

(f) WinZipBS: VDELKAKVDQLKDKNYALRTKVAQLRKKVEKL 
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- (g) WinZipB?: VDELRAQVDQLQDKNYALRTRVAQLKKRVEKL 

(h) WinZipBS: VDELQAEVDQLQDQNYALRTQVAQLKKKVEKL 

(i) WinZipB9: VDELRAQVDQLEDQNYALETQVAQLEKEVEKL 
G) WinZipBIO: VDELQAKVDQLKDENYALQTKVAQLQKRVEKL 
(k) WinZipBH: VDELRAEVDQLEDENYALRTRVAQLRKQVEKL 

7. A method for the identification of optimized hetero-associating (poly)peptides by 
using a hetero-associating (p6ly)peptide of claims 5 or 6 in a method according to 
anyone of claims 1 to 4, wherein a hetero-associaiting peptide WinZipAm of claim 5 
is used instead of library A of (poly)peptides/proteins comprising (poly)peptides Am 
in step (a) of claim 1 , or wherein a hetero-associating peptide WinZipBn of claim 6 
is used instead of library B of (poly)peptides/proteins comprising (poly)peptides Bn 
in step (b) of claim 1 . 

8. An optimized hetero-associating , (poly)peptide obtainable by the method of claim 7, 

9. A pair of hetero-associating (pbly)peptides taken from the list of: 



(a) 


WinZipAl and WinZipBI 


(b) 


WinZipA2 and WinZipBI 


(c) 


WinZipAl and WinZip B2 


m 


WinZipA3 and WinZip 83 


(e) 


WinZipA4 and WinZip 84 


(f) 


WinZipAS and WinZip 85 


(g) 


WinZipAS and WinZip 86 


(h) 


WinZipA? and WiiiZip 87 
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(i) WinZipAS and WinZip B8 
(j) WinZipA9 and WinZip B9 
(k) WinZipAlO and WinZip BI O 
(I) WinZipAn and WinZip Bi t 

10. A (poly)peptide/protein connprising one of the hetero-associating (poly)peptides of 
claims 5 or 6, or an optimized hetero-associating (poly)peptide of claim 8, and a further 

. (poly)peptide/protein. 

11. A (poly)peptide/protein of claim 10 wherein said further (poly)peptide/protein. is an 
enzyme, a toxin, a cytokine, a metal binding domain, a trahscription factor, a member of 
the immunoglobulin superfamily, a bidactive peptide of 5 to 15 arnino acid residues, a 
peptide hormone, a growth factor, a lectin, a lipoprotein, a peptide which is able to- bind 
to an independent binding entity, or a functional fragment of any said further 
(poly)pieptide/protein. 

12. A hetero-associated (poly)peptjde/protein comprising at least two 
(poly)peptide/proteins of claims 10 or 11, associated by hetero-association of a hetero- 
associating (poly)peptide Am and a hetero-associating (poly)peptide Bp. 

13. A DNA sequence encoding a hetero-associating {poly)peptide of claims 5 or 6, an 
optimized hetero-associating (poly)peptide of claim 8. or a (poly)peptide/protein of 
claims 10 or 11. 

14. A DNA sequence encoding a hetero-associating (poly)peptide wherein said DNA 
sequence hybridizes under stringent conditions to a DNA sequence encoding a hetero- 
associating (poly)peptide of claims 5 or 6. 
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15. A vector comprising a DNA sequence of claim 13 of 14. 

16. A vector comprising DNA sequences encoding at least two (poly)pepticie/proteins 
of claims 10 or 11, comprising at least a hetero-associating (poly)peptide Am and a 
hetero-associating (poly)peptide Bn. 

1 7. A host cell containing at least one vector of claims 1 5 or 1 6. 

18. A host cell of claim 17 which is a rnammalian, preferably human cell, a yeast cell, 
an insect cell, a plant cell, or a bacterial, preferably E.coli cell. . 

19. A method for the production of a hetero-associating (poly)peptide of claims 5 or 6, 
an optimized hetero-associating (poly)peptide of claim 8. a (poly)peptide/protein of 
claims 9 or 10, or ahetero-associated (poly)peptide/protein of claim 12, which comprises 
culturing the host cell of claims 17 or 18 in a suitable medium, and recovering said 
(poly)peptide or said (poly)peptide/protein produced by said host cell. 

20. A pharmaceutical composition comprising the hetero-associated 
(poly)peptide/protein of claim 12. 

21. A diagnostic composition comprising the hetefo-associated (poly)peptide/protein 
of claim 12. ^ 

22. A kit containing at least one of . 

(a) a hetero-associating (poly)peptide of claims 5 or 6, an optimized hetero- 
associating (poly)peptide of claim 8, or a (poly)peptide/protein of claims 1 0 or 1 1 , 
or a hetero-associated (poly)peptide/protein of claim 12; or 

(b) a vector of claims 1 6 or 1 7. 
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Figure 1 : 
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Figure 2: 
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Figure 5: 
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Figure 6: 
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Figure 8: 
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Figure 11: 



(A) 

WinZipAl: vaqleekvktlraqnyelksrvqrlreqvaql 

WinZipA2:' vaqlreryktlraqnyelesevqrlreqvaql 

WinZipAS: vaqlqekvktlrarnyelksevqrleekvaql 

WinZipA4: . VAQIaEEQVKTLQARNYELKSKVQRLKEKVAQIi 

WinZipAS: - vaqleeryktlraqnyelkskvqrleeqvaql 

WinZipA6: VAQLEEQVKTLEAENYiELKskvQRLRERVAQL 

WinZipA7: vaqlqeqvktleaqnyelesevqrlkeqvaql 

WinZipAS: vaqleervktlkaenyelesevqrlkervaql 

WinZipA9: . vaqleekvktlkaknyelkskvqrlkekvaql. 
WinZipAl 0: " vaqlqeevktlqaenyelrs evqrleeevaql . 

WinZipAl 1 : VAQLRERyKTLRARNYELQSKVQRLKERVAQL 



WinZipBI: VDEtQAEVDQLQDiENYALKtKVAQLR 

WinZipB2: - . vdelkae vdqlqdqnyalrt kvaqlrkeyeiQj 

WinZipB3: ; vdeleaevdqlkdqnyalktkvaqlqkqvekl 

VVinZipB4r vdelrakvdqlqdenyaletevaqLqkryekl 

WinZipBS: . VDELEAEVDQLEDbNYALQTRVAQLEKRVEKL 

WinZipBB: VPELKAKVDQLKDKNYALRTKVAQLRKKVEKL 

WinZipB?: • vdelraqvdqlqdknyalrtrvaqlkkrve^ 

WinZipBSr VDELQAEVDQLQDQNYALRTQVAQLKKKyEKL 

WinZipB9: vdelraqvdqledqnyaletqvaqlekevekl 

WinZipBI 0: vdelqakvdqlkdenyalqtkvaqlqkrvekl 

WinZipB1 1 : vdelraevdqledenyalrtrvaqlrkqvekl 



11/11 



